Asymptotic theory for the multidimensional random 
on-line nearest-neighbour graph 

Andrew R. Wade* 
Department of Mathematics, University of Bristol, 
University Walk, Bristol BS8 1TW, England. 

September 2008 



Abstract 

The on-line nearest-neighbour graph on a sequence of n uniform random points in 
(0, \) d (d G N) joins each point after the first to its nearest neighbour amongst its 
predecessors. For the total power-weighted edge-length of this graph, with weight ex- 
ponent a £ (0, d/2], we prove 0(max{n 1_ ( 2a /^ , log n}) upper bounds on the variance. 
On the other hand, we give an n — > oo large-sample convergence result for the total 
power- weighted edge-length when a > d/2. We prove corresponding results when the 
underlying point set is a Poisson process of intensity n. 

Key words and phrases: Random spatial graphs; network evolution; variance asymptotics; 
martingale differences. 
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ondary) 

1 Introduction 

The (random) on-line nearest-neighbour graph, which we describe in detail below, is one 
of the simplest models of the evolution of (random) spatial networks. Graphs with an 'on- 
line' construction, whereby vertices are added one by one and connected to existing vertices 
according to some rule, have recently been the subject of considerable study in relation to the 
modelling of real- world networks. Examples of modelling applications include the internet, 
social networks, and communications networks in general. The literature is extensive (see 
e.g. [6,11] for surveys), but mostly non-rigorous; rigorous mathematical results are fewer in 
number, even for simple models, and existing results concentrate on graph-theoretic rather 
than geometrical properties (see e.g. [4,5]). 

In recent years, much progress has been made in obtaining large-sample limit theorems 
for functionals defined on graphs in geometric probability, see e.g. [2,3,9,12-14,17,18]. The 



* e-mail: Andrew.Wade@bris.ac.uk 



1 



graphs in question are locally determined in a certain sense. A natural functional of interest 
is the total (Euclidean) edge length of the graph, or, more generally, the total power-weighted 
edge-length, i.e. the sum of the a-powers of each edge length for a fixed weight exponent 
a > 0. The on-line nearest-neighbour graph (ONG) is of particular theoretical interest since 
its total power-weighted length functional has both normal and non-normal limiting regimes, 
depending on the exponent a. (Another example of such a graph was given in [15], but there 
spatial boundary effects were crucial.) Moreover, the complete central limit theorem for the 
ONG seems just beyond reach of existing general results such as those of [3, 13, 14, 17] which 
employ various concepts of 'stabilization'. 

The ONG is constructed on points arriving sequentially in M d by connecting each point 
(vertex) after the first to its nearest (in the Euclidean sense) predecessor. Many real-world 
networks have certain characteristics in common, including spatial structure, localization 
(connections tend to join nearby nodes), and sequential growth (the network evolves over 
time by the addition of new nodes). The ONG is one of the simplest models of spatial 
network evolution that captures these features. 

The ONG appeared in [4] as a growth model of the world wide web graph (for d = 2), 
as a simplified version of the so-called FKP network model [7]. [4] studied, amongst other 
things, the vertex-degree distribution of the ONG. Here we are concerned with geometrical 
properties: in particular, the large-sample asymptotic behaviour of the total power-weighted 
edge length of the ONG on uniform random points in the unit cube (0, l) d , d £ N : = 
{1,2,3,...}. 

In the present paper, we add to previous work on the ONG. In [19], explicit laws 
of large numbers were given for the total power-weighted length of the random ONG in 
(0, l) d , via an application of general results from [18]. [13,16] gave partial classification of 
the distributional limits of the power-weighted length of the ONG on uniform random points 
in (0, l) d . In particular, when d = 1, for exponent a > 1/2, [16] showed, by a 'divide-and- 
conquer' approach (and the 'contraction method' [10]), that the limiting distribution of the 
centred total power-weighted length of the ONG is described in terms of a distributional 
fixed-point equation. In particular, these distributional limits are not Gaussian. 

It is natural to look for central limit theorems (CLTs), i.e. proving that, for general 
dimensions d £ N, for suitable values of a, the total weight, centred and appropriately 
scaled, converges in distribution to a Gaussian limit. Penrose [13] gave such a CLT for 
d £ N and a £ (0, d/4): see Section [2] below. As stated in [13,16], it is suspected that a CLT 
holds throughout a £ (0, d/2]. One contribution of the present paper is to give variance 
upper bounds for the total power-weighted edge length of the ONG for a £ (0, d/2}. These 
upper bounds are believed to be tight, and are consistent with the conjectured central limit 
theory. Our methods for estimating variances are based on a martingale difference approach, 
and delicate estimates of changes in the power-weighted length of the ONG on re-sampling 
a particular vertex. 

We also give a convergence in distribution result for the total power-weighted length of 
the ONG, centred as necessary, for a > d/2. This improves on an earlier result from [16], 
where such a result was given for a > d. We prove this result via a refinement of the 
martingale difference technique that yields the variance bounds. 

Intuition behind the a = d/2 phase transition in the limiting behaviour is provided by 
the fact that increasing the weight exponent a increases the relative importance of longer 
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edges; for large enough a this amplifies the inhomogeneities in the structure of the ONG 
('old' edges tend to be much longer) and so destroys the Gaussian behaviour. 

In the next section we give a formal definition of the model and state our main results. 

2 Definitions and results 

Let d G N. Let (X 1? X 2 , . . .) be a sequence of points in (0, l) d . For n G N, let X n denote 
the finite sequence (Xi,...,X n ). The on-line nearest-neighbour graph (ONG) on vertex 
set {Xi, . . . , X n } is constructed by joining each point of X n after the first by an edge to 
its nearest neighbour amongst those points that precede it in the sequence. That is, for 
i — 2, . . . , n we join Xj by a directed edge (Xj, Xj) to X^, 1 < j < i, satisfying 

|| Xj — Xj|| = min ||X^ — Xj||, 

l<fc<i 

where || • || denotes the Euclidean norm on R. d . We use lexicographic order on M. d to break 
any ties. The resulting directed graph is the ONG on X n , denoted ONG(A' n ). 

It is sometimes more convenient to view the ONG as an undirected graph, by ignoring the 
directedness of the edges. From this perspective ONG(X n ) is a tree; in view of the directed 
graph picture, it can be seen as rooted at Xi. 

From now on we take the points X 1? X 2 , ... to be random. On an underlying probability 
space (Q, J 7 , P), let (Ui, U 2 , . . .) be a sequence of independent uniformly distributed random 
vectors in (0, l) d . For n G N, let U n := (U 1 , . . . , U n ). The points {Ui, . . . , U n } of the 
sequence U n then constitute a binomial point process consisting of n independent uniform 
random vectors in (0, l) d . 

For x G M. d and X C W 1 , let d(x; X) := inf ye ^\{ x } ||x — y|| denote the distance from 
x to its Euclidean nearest neighbour in X \ {x}. For d G N and a > 0, define the total 
power-weighted edge length of ONG(W n ) by O d ' a iJA x ) := and for n > 2 

n 
i=2 

Also, define the centred version O d ' a {U n ) := O d ' a {U n ) - E[O d ' a {U n )}. We are interested in 
the behaviour of O d ^(U n ) as n — > oo. 

We also consider the ONG defined on a Poisson number of points. Let (N(t);t > 0) be 
the counting process of a homogeneous Poisson process of unit rate in (0, oo), independent 
of (Ui, U 2 , . . .). Thus for A > 0, N(X) is a Poisson random variable with mean A. With U n 
as defined above, for A > set V\ := Un{\)- In the Poisson case, we again use the notation 
O d > a (V x ) = O d ' a (V x ) - E[O d ' a (T\)} for the (deterministically) centred version. Note that 
the points of the sequence V\ constitute a homogeneous (marked) Poisson point process of 
intensity A on (0, \) d . In this 'Poissonized' version of the ONG, we are again interested in 
the large-sample asymptotics, i.e. the limit A — > oo. 

For d G N let v d denote the volume of the unit-radius Euclidean ci-ball, i.e. 

v d :=n d / 2 |r(l + (rf/2))]- 1 ; 
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see e.g. equation (6.50) of [8]. The following result summarizes previous work (see Theorem 
4 of [19] and Theorem 2.1 of [16]) on the first-order behaviour of O d,a {U n ). Here and 

subsequently ' — denotes convergence in L p -norm, p > 1. 
Proposition 2.1 [16, 19] Let d G N. For a G (0, d), as n — > oo 

For a = cf ; as n — > oo 

EfO^^J-^logn. 
For a > d, there exists /i(d, a) G (0, oo) such that as n — > oo 

E[e» d ' Q (Z4)] 

Remarks, (a) In the particular case c? = 1, Proposition 2.1 of [16] gives 

2 / 2 _Q \ 

n(l,a) = 1+ , (a>l). 

a (a + 1) \ a — 1/ 

(b) These results carry over to the Poisson point process case with O d ' a (V n ): this observation 
follows from now well-known 'Poissonization' methods. 

Second-order (i.e. convergence in distribution) results for O d > a (U n ) and O d ' a (P x ) were 
given in [13,16]. Specifically, Theorem 3.6 of Penrose [13] gives a CLT for a G (0, d/4) and 
Theorem 2.1 (ii) of [16] gives convergence to a non-Gaussian limit for a > d. We summarize 
these results in Proposition 12.21 below. Denote by A/"(0, a 2 ) the normal distribution with 
mean and variance o 2 > 0; this includes the degenerate case Af(0, 0) = 0. Here and 

subsequently ' — denotes convergence in distribution. 

Proposition 2.2 Suppose d G N. 

(i) Suppose a G (0,cf/4). Then [13] there exist constants a 2 da G [0, oo) and <? da G [0,cr^ ] 
such that 

lim ^ 2a - d ^ d Y & T[O d ' a (V x )] = a 2 da) lim n( 2Q -^Var[(^(W n )] = a 2 da -5 2 da , (1) 

A^oo n^oo 

and as A, n — > oo 

A (M/M^»(p A ) _^ jV( , 0-2 J ; n (2a- d )/(2 d )^ (i ,a^ ) _^ ^ ^ _ ^2 J (2) 

(mJ Suppose a > d. Then [16] there exists a mean-zero non-Gaussian random variable 
Q(d, a) such that as n — > oo 

O d ^{U n ) — Q(d,a), (3) 
where the convergence is almost sure and in LP , for any p > 1. 
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It is conjectured (see [13,16]) that the CLTs of Proposition I2.2( i) are in fact valid for all 
a E (0,d/2): 

Conjecture 2.1 [13, 16] Suppose d E N. The limit theorems (T7]j and |I)) are also valid for 
aE[d/A,d/2). 

In ongoing work, we have made some progress towards Conjecture \2.1\ but do not yet 
have a proof. 

The first main result of the present paper, Theorem 12.11 below, provides a version of 
the variance upper bounds in (TjQ) for all a E (0, d/2]. Theorem 12.11 is thus consistent with 
Conjecture 12.11 and the bounds in Theorem 12.11 are believed to be sharp (up to a constant 
factor). 

Theorem 2.1 Suppose d EN. 

(i) For a E (0, d/2), there is a constant C E (0, oo) such that for all n E N, A > 1 

Yai[d d ' a {U n )} < Cn 1 -^^, Var[0 d ' Q (P A )] < CA 1 ^ 2 "^. (4) 

(ii) There is a constant C E (0, oo) such that for all n E N, A > 1 

Y aT [d d ' d/2 {U n )} < Clog(l + n), Yai{d d ' d/2 {Vx)] < Clog(l + A). (5) 

Our second main result extends (jHJ) to all a > d/2 and also to the Poisson case. 

Theorem 2.2 Suppose d E N and a > d/2. Then there exists a mean-zero random variable 
Q(d, a) (which is non- Gaussian for a > d) such that: 

(i) as n — > oo 

6 d ' a (U n )^Q(d,a); (6) 

(ii) and, with the coupling ofU n and V n given by V n := Wjv(n)? 

6 d > a (V n ) -^Q(d,a). (7) 

Remarks, (a) The fact that for a > d the random variables Q(d,a) in ([3]) and Theorem 
12.21 are not normal follows since convergence also holds without any centring; see Theorem 
2.1(h) of [16]. In the special case d — 1, a weaker version of ([6]), with convergence in 
distribution only, was given for a > 1/2 in Theorem 2.2 of [16]. In the d — 1 case, more 
information can be obtained about the distribution of Q(l,a) using a 'divide-and-conquer' 
technique; see [16], in particular Theorem 2.2, where the distribution of Q(l, a), a > 1/2 is 
given (in the binomial setting, and the result carries over to the Poisson setting by Theorem 
12.21 here). Indeed, Q(l, a), a > 1/2, is given by the unique solution to a distributional 
fixed-point equation, and in particular is not Gaussian; see [16] for details. We suspect that 
Q(d, a) is non-Gaussian for a E (d/2, d] also for d > 2. 
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(b) A closely related 'directed' version of the one-dimensional ONG is the 'directed 
linear tree' introduced in [15], in which each point in a sequence of points in (0,1) is 
joined to its nearest predecessor to the left. Following the methods of the present paper, 
one can obtain results for that model analogous to the d = 1 cases of all those in this section. 

Theorem 12.1( h) suggests that the case a = d/2 is of a special nature. Moreover, the case 
d — 2, a — 1 is of natural interest, where we have the total Euclidean length of the ONG on 
random points in (0, l) 2 . We conjecture the following. 

Conjecture 2.2 Let d E N. There exists a constant c 2 , d j 2 E (0, oo) such that 

{\ogn)- l l 2 O d ^ 2 {U n ) -±> Af(0,aJ d/2 ), as n ^ oo. 

The proof (or refutation) of Conjecture 12.21 seems to be a challenging open problem. 

The structure of the remainder of the paper is as follows. In Section [3] we give some 
preparatory results on the properties of the ONG. In Section H] we use a martingale difference 
technique to prove Theorem 12.11 In Section [5] we refine the martingale difference technique 
to give a proof of Theorem 12.21 



3 Preliminaries 

First we introduce some more notation. Let card(A") denote the cardinality (number of 
elements) of a finite set X, and let be the origin of ~R d (d EN). For measurable R C M. d , 
let \R\ denote the <i-dimensional Lebesgue measure of R. Let diam(i?) = sup xye/? ||x — y|| 
denote the (Euclidean) diameter of a bounded set R C K d . Let 5(x;r) be the (closed) 
Euclidean <i-ball with centre x G WL d and radius r > 0. 

In the analysis in Sections H] and below, we will need detailed properties of the change 
in total weight of the ONG on U n when the point Uj, i E {1, . . . , n}, is independently re- 
sampled, i.e., replaced by an independent copy U^. The changes due to edges incident to 
Uj, require most work to deal with. To study these, we make use of the fact that an edge 
from XJj with j > i can only be incident to Uj if XJj falls in the Voronoi cell of Uj with 
respect to {Ui, . . . , Uj}. Hence the preliminary results in this section begin with an analysis 
of such Voronoi cells. 

The next lemma gives bounds on the expected diameter of Voronoi cells in (0, l) d with 
respect to U n . For n E N, let Ki(x) be the Voronoi cell of x G (0, l) d with respect to 
{x, Ui, . . . ,U„}: 

K(x) := (y G (0,l) d : ||x-y|| < min ||y - U,|| 1 C (0, l) d . (8) 



Lemma 3.1 Let d E N, [3 > 0. Then there exists C E (0, oo) such that for all n EN 

sup E[(diam(K(x))) /3 ] < Cn~ f3/d . 

xG(0,l) d 
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We will prove Lemma 13.11 using a construction of overlapping and nested cones from 
p. 1027 of [14]. The argument works for an arbitrary convex set, not just (0, l) d , but here 
we only need the latter. 

For d G {2,3,...}, we can (and do) choose I G N and construct Cj, 1 < i < I a 
finite collection of infinite closed cones in IR d with angular radius tt/12 and apex at 0, with 
Uf =1 Cj = W 1 . Let Cj(x) be the translate of Cj with apex at x G IR d . Let C+(x) be the closed 
cone with apex and principal axis coincident with those of Cj(x) but with angular radius 
7r/6. When d = l, we take 1 = 2 and let Ci = [0, oo), C 2 = (— oo,0], and for x G f set 
Ci(x) = C+(x) = [x, oo) and C 2 (x) = C 2 + (x) = (-oo, x]. 

Let d G N. For x G M d and r > 0, let Q(x;r) := C;(x) n B(x;r) and C+(x;r) : = 
C f (x) fl -B(x; r). For n G N, define the event 

£n(x;r):= f| RnC+(x;r) 

i:l<i<I,diam(Ci(x;r)n(0,l) d )=r 

with the convention that an empty intersection is Q. Then E n (x.;r) C i? n+1 (x;r), and for 
s > r, £ n (x; r) C £ n (x; s). For x G (0, l) d , set 

i? n (x) := inf{r > : E n {x.;r) occurs}. (9) 

Note that a.s., -R n (x) < d 1 / 2 . The next lemma is the main step in the proof of Lemma [3.11 

Lemma 3.2 Suppose d G N. For (3 > t/iere exists C G (0, oo) sitc/j £/iat /or all n £ N 

sup E[i? n (x) /3 ] < Cn-^ d . 

xG(0,l) d 

Proof. For x G (0, l) d and r > 0, P(i?„(x) > r) < P(£ n (x; r) c ), so that 

P(i?„(x) > r) < P I |J {W n n C+(x; r) = 0} J , (10) 

y:l<i</,diam(C 1 ;(x)n(0,l) ti )>f / 

with the convention that an empty union is empty. Suppose d G (Ojd 1 / 2 ]. For any i with 
diam(Cj(x) fl (0, l) d ) > r, we can by convexity choose a (non-random) z G Cj(x) fl (0, l) d at 
distance r/2 from x. Then (since | < | sin ^) we have that S(z; r/4) fl (0, l) d is contained in 
C^"(x;r) and, since r < d 1 / 2 , has |5(z; r/4) n (0, l) d | > Cr d for some C G (0, oo) depending 
only on d. Hence for any % with diam(Cj(x) fl (0, l) d ) > r, 

P(ZY n n C+(x; r) = 0) < P(ZY„ n S(z; r/4) n (0, l) d = 0) < (1 - Cr d ) n , (11) 

for some C G (0, oo) depending only on d. Applying Boole's inequality in (fTOI) . using (TTTT) . 
and noting that 1 — x < e _x for any x > 0, we have that there are constants C, C G (0, oo), 
depending only on d, such that for all r > and n G N 

sup P(Pt„(x) > r) < C'exp(-Cnr d ). 

xe(o,i) d 
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Hence for (3 > and n G N, setting s = Cnr d ^^ 

roo roc 

sup E[i? n (x) /3 ] = sup / P(i2 n (x) > r 1//3 )dr < C" / exp(-CW d//3 )dr 
xe(o,i) d xe(o,i) d Jo Jo 

POD 

< C'n- p/d / s^ /d) - 1 exp(-s)ds = C'n-^ d T(/3/d), 
Jo 

using Euler's Gamma integral (see e.g. 6.1.1 in [1]) for the last equality. □ 

Now we can complete the proof of Lemma 13.11 
Proof of Lemma 13.11 With R n (x) as defined at we claim that 

diam(K(x)) < 2i?„(x) (12) 

for all x G (0, l) d and all n G N. Thus for (3 > 0, E[(diam(K(x))) /3 ] < CE[(i4(x))^] < 
C> n -W, by Lemma E21 proving Lemma [3. II 

To verify the claim f|T2|) . suppose that y G (0, l) d lies at distance s > r = _R n (x) 
from x. Then we can choose i such that y G C,;(x), so clearly diam(Cj(x) fl (0, l) d ) > r 
and diam(Cj(x; r) fl (0, l) d ) = r. By definition of i? n (x) we must have some point of 
U n nC^(x; r); but then this point lies closer to x than y does, so y is not in the Voronoi cell 
V n (x). Thus sup y6V - f x ) ||x — y|| < i? n (x). Then the triangle inequality implies the result. □ 

Next we establish the results that we will need later to control the changes in the ONG 
on re-sampling a vertex. Let D C M d be a measurable, non-null convex region and let x G D. 
Let (Xi,X 2 , . . .) be a sequence of independent uniform random points on D. We use the 
notation X n = (Xi, . . . , X n ), and for x G D set X* := (x, Xi, . . . , X n ). For a finite sequence 
X of points in M. d and two points x, y of X, let E(pc, y; X) denote the event that (x, y) is an 
edge in the ONG on X. Let C^' a (-D; n) denote the total power- weighted length, with weight 
exponent a > 0, of edges incident to x in the ONG on sequence X*, i.e. 

n 

0^(D;n) ^^l^x^HX.-xir. (13) 

i=l 

In the special case D = (0, l) d , we will write Uj for Xj, U n for X n and for Af^, and we 
abbreviate notation to 

0*» :=O^((0,l) d ;n). 
Lemma 3.3 Le^ dGN. Suppose a > 0. There exists C G (0, oo) swc/i £/iai 

sup sup E[C^' a (n)] < C. (14) 

r»£N xg(0,l) d 

Moreover there exists C G (0, oo) such that for any m, n with < m < n 

sup E[0^ a (n) - 0^ a {m)} < C(m + l)- a/d . (15) 
xe(o,i) d 
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Proof. Fix d G N. For % G N and x G (0, l) d , set 

W t := Of (<) - - 1) = l^cu^HU* - x||, 

with the convention C^'HO) := 0. Thus Wi is the length of the edge from Uj to x in the 
ONG on 14*, if such an edge exists, or zero otherwise. Then for n G N 

n 

O d ^(n) = J2W- (16) 

8=1 

Let % > 2. Given {Ui, . . . , Uj_i}, VFj > only if Uj falls inside the Voronoi cell of x 
with respect to {x, Ui, . . . , Uj_i}, that is Vi_i(x) as defined at (jSJ). In addition, given that 
Uj G Vi_i(x) (an event of probability |T^_i(x)|), we have Wi < diam(Vi_i(x)). So for i > 2 

E[W, Q | Ui, . . . , U,_x] = E[W^l {tWi _ l(x)} | Ui, . . . , Ui-!} 

< |^_ 1 (x)|(diam(^„ 1 (x))) a < (diam(V r i _ 1 (x))) d+a . (17) 

Then taking expectations in (fTT|) we obtain 

sup E[W?\ < sup E[(diam(^_i(x))) d+Q ] < C(i + l) -1 "^, (18) 

xg(0,l) d xG(0,l) d 

for some C G (0, oo) and all % G N, by Lemma 13.11 Then we obtain ( fT4l) by taking expec- 
tations in ( |T6l) and using ( |T8|) . Similarly we obtain ( fT5l) . this time using the fact that for 
1 < m < n 

n oo 
i=m+l i=m+l 

by (EEEJ). This completes the proof. □ 

In addition to 0^ a (D- n), we consider the related quantity 

n 

6 d ^{D- n) := l S ( X! ,x;^)(rf(X,; {X lf . . . , X^}))"; 

1=2 

that is, the total weight of the edges in the ONG on X n from those points that would be 
joined to x in the ONG on X*. In the case D = (0, l) d , we use the abbreviation 

:=a^((0,l) d ;n). 

The following result is analogous to Lemma 13.31 

Lemma 3.4 Let d G N. Suppose a > 0. There exists C G (0, oo) suc/j £/iai 

sup sup E[0£ a (n)] < C. (19) 

neN x6(0,l) d 

Moreover there exists C G (0, oo) such that for any m, n with < m < n 

sup E[d^ a (n) - 0^ a {m)} < C{m + l)- a/d . (20) 
xe(o,i) d 
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Proof. The proof is similar to that of Lemma [3.31 For % G {2, 3, 4, . . .} set 

w t = 6% a (i) - di a (i - 1) = i E{u< , x;Wr) d(u i; {u 1; . . . , u^}), 

where we take 0^ a {l) := 0. Then W { > only if U, G K_i(x). Given that U< G Vi_i(x), it 
follows from the triangle inequality that 

Wi < d(U i5 {U 1; . . . , Ui_!}) < diUf, {x}) + d(x; {U 1; . . . , U^J) < Cdiam(VU(x)), 

for some C G (0, oo) depending only on d. It follows that there exists C G (0, oo) such that 
for all % G N 



E[W t a | Ui,...,U<_i] < C7|y i _ 1 (x)|(diam(y i _i(x))) a < C(diam(y i _ 1 (x))) 



d+a 



Thus by Lemma [3.11 for some C G (0, oo) and all i G N, 

sup E[W^] < C(i + 

xG(0,l) d 

and the lemma follows. □ 

The remaining results of this section will be used later to convert between Poisson and 
binomial results. The first is a technical lemma. 

Lemma 3.5 Suppose (5 > 1 and x > 0. Then, 

-^x 1 - /3 exp(-x /3 ) < f exp(-t /3 )dt - T(l + (1//3)) < 0. (21) 
P Jo 

Proof. Suppose (3 >1 and x > 0. We have 

/•x /*oo /*oo 

/ exp(-t /3 )dt= / exp(-t /3 )dt- / exp(-^)dt. (22) 

Jo Jo Jx 

We deal with each integral on the right-hand side of ([22"]) separately, using the change of 
variable y = t 13 . By Euler's Gamma integral (see e.g. 6.1.1 in [1]) we have 



o 



i r i 

e-"dt =-] y^-'e-My = -T(l/f3) = T(l + (1/(3)). (23) 



For the second integral on the right-hand side of (1221) we have 



i roc -i p oo i 

0< / e~ tP dt = ^ yWV-h-idy < / e-My = V- (1/ffl e-^. (24) 

A P ix" P ix* 3 P 

Then from (1221) with (El and (El we obtain (EE). □ 



To deduce the Poisson parts of Theorems 12.11 and 12.21 we will need some estimates of 
incremental expectations, improving upon those in Section 3 of [16]. For n G N set 

Z n := O d '\U n )-O d '\U n ^), 
10 



taking O d,1 (Uo) '■= 0. Thus Z n is the gain in length on addition of the nth point in the ONG 
on (Ui, U2, . . .)• Then for n G N 

n 

1=1 

Note that (ES) with ([27]) below implies that for a G (0, d) 

E[O d ' a (U n )} = -^—v- a/d T(l + (a/d^n 1 '^ + 0(max{l, n i-W<t)-W<t)+*}) } (26) 

(X Oi 

for any e > 0, which improves upon the o(n 1 ~ ( ' a//d * ) ) error term implicit in Theorem 2.1 (i) 
of [16]. 

Lemma 3.6 Suppose d G N and a G (0, d]. Then for fieN 

E[Z%] = v~ a/d r(l + (a/d))n- a ' d + h{n), (27) 
where h(n) = 0(n~^ a ^~^ 1 ^ +s ) as n — > 00, /or any e > 0. 

Proof. Let d G N. For r > and x G (0, l) d , set A(x;r) := |(0,l) d nB(x;r)|. For n > 2, 

P(Z£ > 2 I U„) = P({Ux, . . . , U n _x} n B(U n] z l ' a ) = I U n ) = (1 - A{\J n - z l l a )) n -\ 
For r > d 1 / 2 , A(x; r) = 1 for all x G (0, l) d . Then for U n G (0, l) d , 



E 



[Z: I U„] = / P(Z n > z 1 /* I U n )d^ = / (1 - A(XJ n ; z'^r^dz. (28 
Jo Jo 



Fix e G (0, 1/d) small. For all n large enough so that rf-W® < 1/2, let S n denote the region 
\jf~ < ^' d \ 1 — n s ~( 1 / d )} d . For x = (xi,x 2 , . . . ,x d ) G (0, l) d let m(x) := mm{xx, . . . ,Xd, 1 — 
Xi, . . . , 1 — x^}, i.e. the shortest distance from x to the boundary of (0, l) d . Consider x G S n . 
For < r < m(x), A(x;r) = v d r d , and for r < d 1 / 2 , CV d < A(x; r) < v d r d for some 
C G (0, Ud) depending only on d. Thus from (1281) 

/■m(U„) Q ~ n ea-(a/d) 

E[Z" I U n G £„] > / (1-^T^> / (l-^ d/a )"-Mz, (29) 

Jo Jo 

since m(U n ) > n e_ ( 1//d ) for U n G S^. For x > Taylor's Theorem with Lagrange remainder 
implies that e~ x = 1 — x + Cx 2 where C G [0, 1/2], so for z < n e0l ~( a ' d > and n large enough, 
we have that 

(1 - v^y- 1 > (eM-v d z d / a ) - \^ a \ 

= exp(-v d nz d/a ) (l - l -v\z 2dla exp(v d z d/a )^j 
> exp(-v d nz d/a ) (l + O (n 2d£ - 2 eMvan^- 1 )))" 
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= exp{-v d nz d/a )(l + 0{n 2de ~ 1 )), 
as n — > oo, since e < 1/d. So from ff29l) we have that 



eo: — {a./ d) 



E[Z^ | U n G 5 n ] > (1 + O^ 2 ^ 1 )) / exp(-v d nz d / a )dz. (30) 



Now, setting s = (v d n) a / d z, for a 6 (0, d] 



EOt 



n e a -( a /d) <*/d r 

exp(-v d nz d/a )dz = (nv d )~ a ^ d / exp(-s d/a )ds 



Jo 

= {nv d y a l d T{l + (a/d)) + 0(exp(-v d n £d )), (31) 
using (121]) for the final equality. So we obtain from (!30l and ( J3TT) that for e > 

EK | U n G S„] > (n^)~ a / d r(l + (a/d)) + ©(n 3 *- 1 "^). 
For the upper bound, using the fact that 1 — x < e _a; for x G (0, 1) we have from ff28l) 

E[Z^ | U n G S n ] = / (1 - A(U n ; z^y-'dz 
Jo 

ea-(a/d) 

< / exp(-^(n - l)z d/a )dz + / exp(-C(n-l)z d/a )dz. (32) 



For a G (0, d], the second term on the right-hand side of (132]) is 0(exp(— Cn ed )), using (TJi 
with t = (C(n - l)) a / d z, (3 = d/a, and x = (C(n - i))°/V a - (a/d) . Also, the first term on 
the right-hand side of (1321) is bounded by 

exp^r^" 1 ) / exp(-v d nz d/a )dz = (nv d )- a/d T(l + (a/d)) + 0(n e ^^ (Q/d) ), 

Jo 



by (1311) . So from ( 1321) . for the upper bound we obtain 

E[Z^ | U n G 5 n ] < {nv d )~ a / d T{l + (a/d)) + O^" 1 -^). 

Combining the upper and lower bounds we have 

E[Z: | U n G = (™ d )- Q / d r(l + (a/d)) + 0(n d£ - 1 ^), (33) 

for a G (0, d] and e small enough. Now consider x G (0, l) d \ S n . Here CV d < A(x; r) < f^r^ 
for r < d 1 / 2 , and by similar arguments to above, we obtain 

E[Z% | U n £ S n ] = 0{n- a ' d ). (34) 
Since P(U„ i S n ) = 0(rf-^), we obtain from ([331) and Q&|) that for any £ > 

E[Z£] = E[Z« | U n G S' n ]P(U n G 5 n ) + E[Z£ | U n £ 5„]P(U n £ S n ) 
= [nv d y a/d T{l + (a/d)) + 0{n £ - {a/d) - {1/d) ), 

and so we have (1271) . □ 
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4 Proof of Theorem 12.1 



The aim of this section is to prove the upper bounds on variances for O d,a {lA n ) and O d,a (V\) 
given in Theorem 12. II The following martingale-difference result is the key to the proof of the 
binomial parts of Theorem 12.11 Some extra work is then needed to derive the 'Poissonized' 
version of the result. 

Lemma 4.1 Let d G N and a > 0. For each n G N, there exist mean-zero random variables 
D^, i = 1, 2, . . . , n, such that: 

ft) Er=iA (n) = ^' Q (Wn); 

(ii) E[Dj n) Z>j n) ] =0fori^ j; 

(in) there exists C G (0, oo) such that E[(D 4 (n) ) 2 ] < Ci- 2a/d for all n, i. 

Before proving the lemma, we introduce some more notation. For n G N, let T n denote 
the cx-field generated by U n . Let JF denote the trivial a- field. For ease of notation during 
this proof, set Y n = O d ' a (U n ). Then we can write for neN 



(n) 

) 

i=l 



where for i G {1, 2, . . . , n} 



D\ n) = E[Y n | JFJ - E[Y n | (35) 



and for fixed n the D^ n \ i = 1, . . . ,n are martingale differences, and hence orthogonal (see 
e.g. Chapter 12 of [20]). This establishes parts (i) and (ii) of the lemma. It remains to 
estimate E[(Df ) ) 2 ]. G iven U n = (Ui, . . . , U n ), for i G {1, . . . , n} let be an independent 
copy of Uj (independent of Ui, U 2 , . . .) and set 

K-.= (u 1 ,...,u i _ 1 ,u<,u i+1 ,...,u ri ) > 

so U % n is U n with the zth member of the sequence independently re-sampled. Define 

A (n) . = Qd, a{ jji^ _ Qd,a(pQ = d '<*(U l n ) - O d > a (U n ), 

the change in Y n on re-sampling the point Uj. Then it is the case that 

Df } = -E[AS n) | Tl 

We split A- n ^ into six components as follows. Let A^" be the weight of the edge from Uj in 

the ONG on U n , and let A-^ be the weight of the edge from U- in the ONG on W n . Let A- 3 

be the total weight of the edges incident to Uj in the ONG on U n , and let A^ be the total 

weight of the edges incident to U- in the ONG on W n . Let A^g be the total weight of edges in 
the ONG on (Ui, . . . , Uj_i, Uj+i, . . . , U n ) from points in U n that are joined to Uj in the ONG 
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on W n . Let A^g be the total weight of edges in the ONG on (Ui, . . . , Uj_i, Uj+i, . . . , U n ) 
from points in U n that are joined to Uj in the ONG on U n . Then 

A (n) = A (n 9 } + A ( " } + A ( " } - A ( ™ } - A {n , } - A<" } . 

The next result will be crucial for the proof of Lemma 14.11 

Lemma 4.2 For any a > there exists C £ (0, oo) such that for all I £ {1, . . . , 6} 

E[(E[Ag } | ^]) 2 ] < Cr 2a / rf , (36) 
for all n £ N and 2 £ {1, . . . , n}. 

Proof. First consider £ £ {1,2}. By the conditional Jensen's inequality, 



E[(E[Ag I Hf] < E[E[(Ag ) ) 2 I ^]] = E[(Af } 



»\21 



For £ £ {1, 2}, we have from Lemma 3.1 in [16] (cf (1271) above) that for a > 0, E[(A. 
E[Z 2a ] < Ci- 2a/d for all z',n. Thus for t £ {1, 2} there is a constant C £ (0, 00) such that 
( 1361) holds for all i and n. 

Now consider £ £ {3,4}. For z £ N, let Vj := Vi„i(Uj) be the Voronoi cell of Uj with 
respect to {Ui, . . . , Uj_i, Uj}. Similarly, let V- := Vj_i(U0 be the Voronoi cell of U[ with 
respect to {Ui, . . . , Uj_i, U^}. 

By convexity, there exists a d-cube of side length at most 2diam(Vj) which contains Vj 
and also lies inside (0, l) d . Let Bi denote a minimal-volume such cube. 

Points of {Uj + i, . . . , U n } that fall outside of Vj can never be joined to Uj and can only 
serve to decrease the total weight incident to Uj (by shrinking the subsequent Voronoi cells). 
Hence removing any point of {Uj + i, . . . ,U„} that falls outside Vi (and in particular any 
that falls outside Bi) can only increase the total weight of edges incident to Uj. Moreover, 
{Ui, . . . , Uj_i} necessarily lie outside Vj and their removal can only increase the total weight 
incident to Uj. In other words, for any j > i + 1 and any subsequence Uj of Uj containing Uj 
and Uj, we have E(Uj,Ui;Uj) C E(Uj, Vf, Wj), and P(S(U i , Uj; Uj)) = for any Uj V- 
and in particular any Uj ^ Z?j. 

It follows that 

n 

^£3 = 1 s(u J ,u ! ;W j )||U i - Uj|| a < ^2 Iscu^u.^-^llUj - Uj|| Q , 

j=i+l j:i+l<j<n,XJj£Bi 

where Uj t i is the subsequence of (Uj, . . . , Uj) consisting only of those points in Bi. So in 
particular, given jFj, Ajg is stochastically dominated by 0^f l {B i - ) N) where N ~ Bin(n — 
i,\Bi\) is the number of points of {Uj+i, . . . , U„} that fall in Bi. (Recall the definition of 
0% a (D;n) from ([15]).) We thus have that, given jFj, A-^ is stochastically dominated by 



0^ a {Bi-n) = \B i \ a ' d Oi a \ 



n 
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by scaling, for some x G (0, l) d . Since \Bi\ < C(diam(Vi)) d , we have in particular that for 
all n G N and i G {1, . . . , n} 

E[A$ I Ti\ < C(diam(^)) a sup E[0% a (n)} < C(diam(V;)) a , 

xe(o,i) d 

by (114p . Thus by Lemma [3.11 for all n G N and % G {1, . . . , n}, 

E[(E[Ag | j;]) 2 ] < CE[(diam(^)) 2a ] < CT 2a/d . (37) 
Similarly, E[A 4 - 4 | jFj] < CE[(diam(V7)) Q | JF,] so that, by the conditional Jensen's inequality, 

E[(E[A$ I T$) 2 \ < CE[(diam(\//)) 2Q ] < CT 2a / d , (38) 

for all n G N and 2 G {1, . . . , n} by Lemma 13.11 once more, since V( = V^. Thus from fl371) 
and fl38|) we verify the £ G {3, 4} cases of f l36l) . 

Finally consider £ G {5, 6}. Recall that W^j is the subsequence of (Uj, . . . , Uj) consisting 

(n) 

only of those points in Bj. By the argument above for A i 3 , we have that 

n 

A® = E ^'dW,) (d(Uf {Ui, • • • , U,-.!} \ {\J>})T 

j=i+i 

< E ^(u,-u i%i) (d(U i; {U 1? . . . , U^x} \ {U t })) a • (39) 

j:i+l<j<n,Uj£Bi 

List the points of in order of increasing mark (index) as (Ui, U^, . . . , UjJ. For j > ji + 1, 
observe that removing points outside Bi can only increase the distance from Uj to its nearest 
neighbour amongst {Ui, . . . , U f_i} \ {Uj}, since we know U^ G Bi. Thus we have that for 

d(\J j; {U ls . . . , U^J \ {U,}) < d(U i; ({U ls . . . , U^J \ {U,}) n 5*). (40) 
Then from (1391) and fT4T3l) we obtain 

A l (n 6 ) <(rf(U J1 ;{U 1 ,...,U, w }\{U l })r 

+ E ^(u^u,^) (d(U i5 ({Ui, . . . , U y _i} \ {U,}) n B,)) a . (41) 

j':i+l<i<n,Uj6Bj 

To bound the length of the edge from U^, we note that any point y G V n (x) has 
d(y; {Ui, . . . , UJ) < 2diam(K(x)). Hence ' 

d(U il ;{U 1 ,...,U il _ 1 }\{U i }) < Cdiam(^). (42) 

Recall the definition of 0^ a (D;n) from just above Lemma [3.41 Then from (|41|) with (|42[) . 
we have that, given A^g is stochastically dominated by 

6^{Bi;n) + C(diam(^)) a = \Bi\ a ' d 6% a (n) + C(diam(V^)) a , 
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for some x G (0, l) d . Taking expectations, we obtain from Lemma [3.41 that 
E[Ag | H < C\B i \ a l d + C(diam(Vi)) a < C"(diam(^)) a . 
Then by Lemmas 13.11 we obtain 

E[(E[A$ | J-J) 2 ] < CE[(diam(^)) 2Q ] < C'%- 2al \ 

for all n, i. A similar argument holds for A^g , and thus verifies the £ G {5,6} cases of (1361) . 
This completes the proof of the lemma. □ 

Proof of Lemma 14.11 With given by (1351) . parts (i) and (ii) of the lemma are 
immediate, as described above. The Cauchy-Schwarz inequality and fl36l) imply 

E[(A (n) ) 2 ] = E[(E[A 2 (n) | Fi}) 2 } = E 

for all n,i. This yields part (iii) of the lemma. □ 

To deduce the Poisson version of Theorem 12.11 and later Theorem 12.21 we prove the 
following series of lemmas. 

Lemma 4.3 Let N(n) be a Poisson random variable with mean n > 1. For (3 G [0, 1), 

Vax[iV(n) 1H3 ] < Cn 1 ' 2 ?; (43) 
E[(iV(n) 1_/J - n 1 -?) 2 ] < Cn 1 - 213 ; (44) 
and E[(log(l + N(n)) - log(l + n)) 2 } < Cn' 1 ; (45) 

for some C G (0, oo) and all n > 1. 

Proof. Let n > 1. First we prove (|43D, dH- Let (3 G [0, 1). Set K n := N(n) - n. Then 

N(n) l - p = n 1_/3 (l + n^Kn) 1 '^ (46) 

where by the Intermediate Value Theorem we have that (1 + n~ x K v ) x ~^ — 1 + (1 — /3)(1 + 
H n )^ l3 n^ 1 K n for some if n with |if n | < n~ l \K n \. Hence 

N (ny-P _ n i-P = n -P(i - /3)(1 + tf n )"^ n , (47) 

so that for C G (0, oo) 

E[(iV(n) 1 -' 3 - n 1 -?) 2 } = CrT 2/3 E[(l + H n )-^K 2 n }. (48) 
Let A n denote the event {\K n \ < n 3//4 }. Then 

E[(l + H n y W K 2 n } = E[(l + # B )-^UJ + E[(l + # n )- 2 ^ 2 !^]- 




-1) € E[A 




< Cz 



-2a/d 
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Here, by Cauchy-Schwarz, 

E[(l + H n )-^Kll A ,] < (E[(l + H^K^nK)) 1 ' 2 - 

But by gZD, for C G (0, oo), E[(l + H n )~^K^} = Cn^E[\N{nf- p - n^\% so that 

E[(l + H n )- 2 ^K 2 n l Ag ] < Cn 2P {E[\N{n) 1 ^ - n 1 ^ 4 ]) 172 ^^)) 172 , (49) 

which tends to zero as n — > oo, by standard Chernoff-type Poisson tail bounds (see 
e.g. Lemma 1.2 in [12]). Also, given A n , \H n \ < n~ 1 \K n \ < n~ 1//4 , so that 

E[(l + H n )-^K 2 n l An ] < CE[K 2 n | An] < CnK^nAnY 1 = Cn¥{A n y l ~ Cn, (50) 

as n — ► oo, by standard Poisson tail bounds. So from (J151) . (H91) and (150]) we obtain (jUJ). 
Now from (j4"Tj) we have 

Var[iV(n) 1 - /3 ] = Cn- 2/3 Var[(l + H n )~ p K n ] < Cn^E^l + H^K 2 ]. 

Then from (SHI) and (J5DD we obtain (135)1 . 

Finally, the Intermediate Value Theorem implies that 

log(l + iV(n)) - log(l + n)= log(l + (1 + n)- 1 ^) = (1 + n)-\l + i? n ) _1 i^ n , 

where, as before, < n _1 |if n |. Hence 

E[(log(l + N(n)) - log(l + n)) 2 ] = (1 + n)- 2 E[(l + H n )- 2 K 2 n }. 

Now (1501) still holds with /3 — 1, while instead of (1491) in this case we have 

E[(l + ^)- 2 K 2 1a,] < (1 + ^) 2 (E[(log(l + N(n)) - log(l + n)) 4 ]) 1 / 2 ^)) 1 / 2 , 

by Cauchy-Schwarz, which again tends to zero as n — > oo. Thus we obtain (145j) . □ 

Lemma 4.4 Lei d 6 N and a > 0. Lei iV(n) 6e a Poisson random variable with mean 
n > 1. JTien t/iere exists C G (0, oo) suc/i i/ia£ /or all n > 1 

E[Y8Lr[O d ' a (U N{n) ) \ N(n)\] < C + sup Var[O d ' Q (W m )]. (51) 

l<m<2n 

Proof. We have that 

E[Yav[O d ' a (U N{n) ) | iV(n)]] < sup Var[O d ' Q (W m )] + CE[(iV(n)) 2 l { ^ (n)>2n} ], 

l<m<2n 

using the trivial bound that O d,a (U^^ n )) < CN(n). By Cauchy-Schwarz, the last term in 
the above display is bounded by a constant times 

(E[(N(n)) A }) 1/2 (P(N(n) > 2n)) l/2 , 

which tends to as n —* oo by standard Poisson tail bounds. So we obtain (l5"Ti) . □ 
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Lemma 4.5 Let d G N and a G (0,d]. For n G N, Zei //„ := E[C? d,a (Z4)]. Let JV(n) be a 
Poisson random variable with mean n. There exists C G (0, oo) such that for all n > 1 

E[(^ (n) -/. w ) 2 ] KCn 1 -^. 

Proof. Taking expectations in (|25p . we have that for n G N 

n 

i=i 

First suppose that a G (0, d). By ( 1271) we have that, for integers £, m with 1 < £ < m, 

m , 

* — ' a — a 

i=l+l 

in 

+ h & + 0{m- a l d ) + 0{£- a/d ). (52) 
i=e+i 

In particular, for n > 1 

|/iv ( n) - /i N I = ClJV(n) 1 -^ - n 1 -^ \ + 5(n), (53) 
where from (1521) the random variable S(n) satisfies 

max{Af(n),LnJ} 

\S(n)\< \h(€)\+0(mm{n } l + N(n)}- a/d ). (54) 

i=min{A r (n), |nj } 

On the other hand, for a — d, this time (!27|) implies that for 1 < £ < m, 

in 

H m -Hi = ^(log(l + m) - log(l +£)) + J2 h tt) + °( m ~ 1 ) + °( r1 )- ( 55 ) 

i=i+l 

In particular, for n > 1, (I55p gives 

\HN{n) — A*L«jl =C|log(l + JV(7i))-log(l + n)| + <y(n), (56) 

where again 5(n) satisfies (1541) . now with a — d. 

We now claim that for all a G (0, d], S(n) as defined by (1531) or (|56|) satisfies 

E[5(n) 2 ] = o(n 1 - (2a / d) ), as n -> oo. (57) 

Then in the case a G (0, d), (|57|) with (1531) . (|44l) and Cauchy-Schwarz yields the lemma. In 
the case a — d, the result follows from (joTj) with (1561) . (|45|) and Cauchy-Schwarz again. 

It remains to prove the claim ( IBTl) . We start from the fact that for a G (0, d], S(n) satisfies 
(I54p . Note that there exists C G (0, oo) such that for n > 1 

E[min{n, 1 + N{n)}~ 2a/d ] < n~ 2a/d + E[(l + N{n))- 2a/d ] < Cn- 2a/d , (58) 
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as can be proved by standard Poisson tail estimates as used elsewhere in the present paper 
(cf Lemma [5. II for an analogous binomial result). 

Now we deal with the main term in (1541) . We have that for n > 1, a G [0,d] and any 
77 6 (0,1/2), 

max{m, \n\ } 

sup \h{i)\ < (2n {1/2)+ri + 1) sup \h{m)\; (59) 

-meN:|m-LnJ|<n( 1 /2)+»7 i=wia r m \ n \\ meN:\m- \n\ |<n( 1 /2)+ I ? 

it follows from (1591) and Lemma [3T61 that for any r\ G (0, 1/2), e > 0, there exists C G (0, oo) 
such that for all n > 1 

max{m,[n\} 

sup <C^ (1/2)+r? " (1/d)_(a/d)+£ - (60) 

c I L"JI — " i=min{m, [raj } 

In particular, this is o(n^ 1 ' 2 ' ' a '^) for sufficiently small e, 77. Now we have, with 77 > as 
above, for n > 1 



E 



max{7V(n), [raj } 

£ " \h(i) 

i=min{7V(n), [raj } 



2n 




)] 









max{m, [raj } 

sup £ IM«) 



me N:| m -Lnj|<n(l/2)+. i=min{mjLnJ} 
+CTE[|JV(n)- H| 2l {|iV(n)-Lnj|>n(i/2)+ n ], (61) 



and by Cauchy-Schwarz 

E[|tf(n) - HI 2 l { |v(n)- L nj|>n(v 2 ) + , } ] < (E[|tf(n) - LnJ|*]P(|iV(n) - HI > n^+i))^ 

which is o(n 1- ( 2a ' d )) as n — > 00, by standard Poisson tail bounds. Thus from (ISTj) . (1601 . 
( I5"8l) , and Cauchy-Schwarz, we verify (JBTj) . □ 

Proof of Theorem 12.11 First we prove the binomial parts of (HI) and (j5J). By part (i) of 
Lemma T4. 11 we have that O d,a (U n ) = J^ILi f° r each n G N. By the orthogonality of the 
(part (ii) of Lemma 14.11) we have that for nGN 



Y&r[O d ' a {U n )] = £e[(D 



4 l) )' 2 } 
i=i 



which by part (iii) of Lemma 14.11 yields the upper bounds as claimed. 

We now deduce the Poisson parts of (j4j) and ([5]). For ease of notation, let X n := O d ' a (U n ) 
and n n := E[X n ]. Then if N(n) is Poisson with mean n > 1, O d ' a {V n ) has the distribution 
of Xn{ti) and its expectation is E[/ijv(n)] = : a n- Write 

O d ' a (V n ) = X N ( n ) — a n = (Xjv(„) — fXN{n)) + (H{n\ ~ «n) + (/^V(ra) ~ H\n\)- (62) 

Then Var[O d ' a (P„)] = Var[(X JV(n) - /^ (n) ) + (ji N{n) - /i W )]. We have 

Var[X w(n) - ti N{n) ] = E[Vax[X N[n) - fi N{n) \ N{n))) = E[Yar[O d ' a (U N{n) ) \ N(n)}}. 
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By (|5T|) this is bounded by a constant times sup m<2n Yar[O d ' a (U m )}, which, using the bino- 
mial parts of (j4]) and ([5]), is bounded by a constant times n l ~ <y2a l d ' > for a G (0, d/2) and by a 
constant times log(l + n) for a = d/2. So we have for C G (0, oo) and n > 1 

v ry , . / Cn 1 -^ if a G (0,d/2); 

VarLXv (n) - ^ (B) ] < | + n) . f a = d/% (63) 

The final term on the right-hand side of fl62|) satisfies Lemma 14.51 So by fl62|) with Lemma 
14.51 (1631) . and Cauchy-Schwarz, we obtain the Poisson parts of ()4|) and (jSJ). □ 



5 Proof of Theorem 12.2 



By Lemma [4.11 we have that for a > d/2, for all n G N 

n n 

Var[6 d > a (U n )} = J2 E [( D i n) ) 2 } < Cj2*~ 2a/d < C < oo. 

In order to show that O d,a (lA n ) in fact converges, we employ a refinement of the martingale 
difference technique of Section HI First we need two more lemmas. 

Lemma 5.1 Suppose X ~ Bin(n,p) for n G N and p G (0, 1). Then for any (3 > there 
exists C G (0, oo) such that for all n G N and a// p G (0, 1) 

E[(l+X)- /3 ] < C{np)-P. 

Proof. We have that 

E[(l + X)^} < (1 + (np/2))-P + E[(l + X)-Pl {x<np/2} ] 

< C(np)-P + (E[(l + X)- 2/3 ]) 1 / 2 (P(X < np/2)) 1 ' 2 , 

for some C G (0, oo) and all n G N, p G (0,1), using Cauchy-Schwarz. But for (5 > 0, 
(1 + X)~ 2/3 < 1 a.s., so E[(l + X)~ 2/3 ] < 1. Also, by standard binomial tail bounds (see e.g. 
Lemma 1.1 in [12]), P(X < np/2) = 0(exp(—Cnp)) for all n,p. □ 

Lemma 5.2 Suppose d G N and a > d/2. Fore > sufficiently small, we have that 

lim sup \E[O d ' a {U n )] -E[O d ' a {U m )}\ = 0. (64) 

n ~ *°° m:\n- m|<n( 1 / 2 )+ e 

Proof. For ease of notation, let //„ := E[C d ' a (W„)]. By monotonicity of 

SUp — fj, m \ < max{/i n+ |- n (i/2)+ £ -| — fl n , fi n — /i n _|-„(l/2)+e-| }, 

m:|n— m|<n( 1 / 2 )+ e 

so it suffices to show that both terms in the maximum tend to zero as n — > oo. Consider the 
a G (d/2, d) case of ( |52j) . Now by Lemma I3T61 we have, for small enough e > 0, 

^ /i(z) < Cn^ 2 ) +e sup h(i) < Cn^J-W^-tV^ = ( n a/2)-W<*)) ; 

i=n+l i:n<i<n+rn(V2)+e] 
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which tends to as n — > oo, given that a > d/2. Thus by (|52|) . as n — > oo, 

/i n+rn (i/ 2)+ e 1 = ^-^((l+n-^ 2 ^) 1 -^ - 1) + o(l), 

for some C E (0,oo). But this is 0(n (1/2 ^ a/<2 ) +£ ), which tends to zero for a > d/2 and e 
small enough. Similarly for \i n — /i ra _|- n (i/2)+ £ -| . Thus we obtain flM]) for a e (d/2,d). 

Now suppose that a = d. This time we have (155]) : by Lemma [3761 the sum in (1551) tends 
to as m, £ — > oo. Thus for e > small enough 

/ n _L r n (l/2)+e-| \ 

yUn+r „( 1/2)+£l - /i„ = i;," 1 log ( ^ Lj + (i) = 0(n £ -( 1 / 2 )) + o(l) - 0, 

and similarly for \i n — n n _ ^(l/aj+e-i . Thus we get for a = d. The case a > d is 
straightforward, since there (see Proposition 12. ip fi n — > fi(d,a) G (0, oo) as n — > oo. □ 

To prepare for the proof of Theorem 12.21 we modify the technique used in the proof of 



is 



Lemma ED above. For n,m e N with m < n set Yn := O d ' a (U n ) - O d > a (U m ), i.e. K r 
the centred total weight of edges in the ONG on U n counting only edges from points after 
the first m in the sequence. With Ti the cx-field generated by (Ui, . . . , Uj), set 

so that for fixed n, m the D^ l,m " > are martingale differences and 



(n,m) 

8=1 



As in Section HI for i e N let be an independent copy of Uj. For i < n let be the 
sequence U n but with Uj replaced by UJ. If i > n, we take lA l n =U n . Define 

= {o d > a (K) - o d ' a (u^)] - p d > a (u n ) - o d > a (u m )]. 

Then, similarly to before, 

D (n,m) = _ E[A (n,m) | ^ 

Analogously to before, we decompose A- n,m ^ into six parts. For i > m, let A J -"' m ' ) be the 
weight of the edge from Uj, and A^' m ' ) be the weight of the edge from U-. For i < m, 
set A i (n 1 ' m) = A^' m) = 0. For all z, let A t ( J m) for £ = 3,4 be the total weight of edges 

incident to Uj, Uj respectively from {U m+ i, U m+ 2, . . . , U n }. Let A^ m) be the total weight 
of edges in the ONG on (Ui, . . . , Uj_i, Uj + i, . . . , U n ) from points in {U m+ i, . . . , U n } that 
are joined to Uj in the ONG on W n . Let A- g be the total weight of edges in the ONG 
on (Ui, . . . , Uj_i, Uj+i, . . . , U n ) from points in {U m+ i, . . . , U„} that are joined to Uj in the 
ONG on lA n . Then we have 

a (n,m) a (n,m) . a (n,m) » (n,m) \ (n,m) a (n,m) \ (n,m) 

i j,2 ' i,4 ' i,6 *jl i,3 i,5 

Note that A^' m) > A^' m+1) and Ajj 13 = A$ as defined in Section H Analogously to 
Lemma [4.21 above, we have the following. 
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Lemma 5.3 For any a > there exists C G (0, oo) such that for all I G {1, ... , 6} 

E[(E[A 2 (n / m) | T % \) 2 \ < Ct~ 2a/d } (65) 

for m < i < n, and, for i < m < n, 

E[(E[A 4 ( "' m) | ^]) 2 } < C(max{m - i, i})- 2a ' d . (66) 

Proof. The argument in Lemma 14.21 carries through, so that ( 1651) holds for all i. Indeed, 
A|"' m ' < AV^ and so Lemma 14.21 implies (161)1) for all i, I. Thus to obtain (1661) we need to 
show that there exists C G (0, oo) such that for all i and all % < m < n 

E[(E[A,5' m) | j;]) 2 ] < C(m - z)- 2Q/d . (67) 

Thus suppose i < m. In this case, we need only consider A for £ > 3, since A -™' m ^ = 
for I G {1,2}. First take I = 3, dealing with the edges incident to Uj. There are m — i 
points of U n with mark (index) greater than i but not more than m, and edges from these 
points to Uj are not counted in A J -^' m ' ) . Recall that Vi,V( is the Voronoi cell of U.;,U^ 
respectively with respect to itself and {Ui, . . . ,Uj_i}, and Bi is a minimal-volume <i-cube 
with ViQBiQ (0, l) d . 

By an argument analogous to that in the proof of Lemma 14.21 discarding points of 
{Uj + i, . . . , U m } that fall outside Bi can only increase A\ n ^ m \ It follows that, with the same 
notation as in that proof, 

n 

= ^(Uj.UtjtyjHUj - Ui|| Q < ^ lBCUj,Ui^,i)||U 5 - - Ui|| Q . 

j=m+l j:m+l<j<n,\Jj£Bi 

Let M ~ Bin(m — i, \Bi\) be the number of points of {Uj + i, . . . , U m } that fall in Bi. Thus 
A-^'"^ is stochastically dominated by 

0^(B i; n) - O d ^(B t] M) = \B i \ a ' i [0£°(n) - 0* a (M)], 
for some x G (0, l) d , by scaling. Hence for some C G (0, oo) 

E[Ag' m) | JFi] < C|fi,| a/d E[(M + l)- Q / d | 
by (USD- By LemmaEH E[(M + l)" Q / d | ^] < (7|S i |- a / d (m - i)~ a/d , so that for i < m 

E [ A Km) | ^ < _ t ya/d_ 

For £ = 4 a similar argument (with Vi replaced by V() holds. Thus we obtain (IBTI) for 
£ G {3,4}. For £ G {5,6} a similar argument applies, using (1201) instead of ( TT5l) this time. □ 

Proof of Theorem 12.21 By Lemma [5.31 and Cauchy-Schwarz we have that for i < m, 

E[(jD (n,m) )2] < C (m a x{m-t,i})- 2a/d , 
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while for i > m, E[(Df' m ') 2 ] < Ci- 2a l d . Thus for a > 0, for m < n 

n 

E[\O d > a (U n ) - O d > a (U m )\ 2 ] = E[(Yt ] ) 2 ] = ^E[( J D l (n ' m) ) 2 ] 

i=l 

\m/2\ m n 

<C^(m- i)- 2a ' d + C l ~ 2a ' d + C Yl ( 68 ) 

i=l i=\m/2\ i=m+l 

In particular, for a > d/2 the right-hand side of fl68|) is bounded by a constant times 
m ( ' which tends to as n, m tend to infinity. Thus for a > d/2, O d > a {U n ) is a Cauchy 
sequence in L 2 , and hence as n — > oo it converges in L 2 to some limit random variable 
Q(d,a), with E[Q(d,a)] = lim^^ E[6 d ' a (U n )} = 0. Thus we obtain ®. 

Finally, we prove the Poisson part ((Tj). As before, let X n := O d,a {U n ) and /x n := E[X n ]. 
For X(n) Poisson with mean n > 0, O d,a (V n ) has the distribution of Xjv(n) and expectation 
E[/x;v(n)] =: Consider, for n > 

E[|(X^ (n) -^ (n) )-Q(d,a)| 2 ] < sup E[|(X m -/i m )-Q(d,a)| 2 ] 

m>n/2 

+E[|(X iV(n) - n N ( n) ) - Q(d,a)\ 2 l {N{n ) <n/2 }]. (69) 

For a > d/2, the L 2 convergence of X n — /i n to Q(d,a) (from (jSJ)) implies that the first 
term on the right-hand side of (16"9l tends to zero, and that the second term is bounded by 
a constant times ¥(N(n) < n/2), which tends to zero as n — > oo. So, for a > d/2, 

L 2 

X N (n) - HN{n) — ► Q(d, a), as n > oo. (70) 

First suppose a > d. Here (see Proposition ^. ip /j n — > // :— fj,(d, a) G (0, oo) as n —* oo. It 
follows, by a similar argument to fl69l) . that /ijv(n) converges to /x in L 2 and a n = E[/jjv(n)] — ► A* 
also. Thus, with (jTOjl . as n — > oo 

C '"CPn) = ^JV(n) - O n = (Xjv( n ) - /ijV(n)) + (^N(n) - fi) + (jJL - (X n ) ► Q(d, a). 

For a e (d/2,d], fi n — > oo as n — > oo. Recall fl62|) . With p m (n) = F(N(n) = m), the 
middle bracket in fl62|) satisfies, for e > 0, 

|a n - /i[ n j I = |/im - /^|nj |Pm(™) + ^ l/^m _ A*M IPmC")- (71) 

mSN:|m— n|<n( 1 / 2 )+ e mSN:|m— n|>n( 1 / 2 )+ e 

Using the trivial bound \x m < Cm, the second sum in (j7ip is bounded by a constant times 
Y ( m + n)p m (n) < E[(JV(n) + n)l {|JV(n) _ re |> n (i/ 2) +e } ], 

m£N:|m-n|>n( 1 / 2 )+ E 

which by Cauchy-Schwarz is bounded by 

(E[(N(n) + n) 2 ])^ 2 (¥(\N(n) -n\> n (V2)+e))i/a -> 0, 
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as n — > oo, by standard Poisson tail bounds. The first sum in (I7ip satisfies 



E 



Aim - H\n\\Pm{n) < 



sup 

m£N:\m-n\<n( 1 / 2 )+ E 



fJ"m 



m£N:\m— n|<n( 1 / 2 )+ e 



which tends to zero as n — > oo by f|64|) . Thus for a G (d/2, d], as n — > oo 



0. 



(72) 



Also, from Lemma 1431 we have that, for a G (d/2, of], E[|/ijv(n) — AHnjI 2 ] — > 0> so that 



Thus from (JS2J with (170)1 . (1721) and (1751) we obtain the result for a G (d/2, d] also. □ 
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